An understanding - based chinese automatic abstract system in special field 面向特定領域的理解型中文自動文摘系統
Study on an automatic abstract method based on density clustering 密度聚類模式下一種基于層次的自動文摘方法研究
57 li l , zhong y x , guo x h . an understanding - based chinese automatic abstract system in special field . j . computer research and development , 2000 , 37 : 6 - 10 中文語義分析研究在漢語語義表達框架及語義分析方法中文意合網絡概念層次網絡理論邏輯語義廣義配價模式等方面取得了進展。
Finally , based on noun phrase reference algorithm and from the aspects of text macro - constitution and micro - constitution , the text topic sentence abstracting algorithm is given and the author does some trial research on the automatic abstract system ( 2 )在名詞性短語的回指已完成的前提下,我們來提取文本的段落、章節以及全文的特征詞,得到文本段落、章節和全文的特征詞集。
In this thesis , the author first introduces the latest development of automatic summarization system in domestic and abroad , which shows the lack of the automatic summarization system research . then the author introduces some basic concepts about automatic abstract system 在本文中,首先介紹了自然語言處理的基礎概念體系,給出了自然語言處理的定義及其研究和處理的方法和過程,接著便介紹國內外關于自動文摘系統等方面的研究方向和發展動態,并指出了自動文摘系統研究的某些不足。
Secondly , some basic concepts about abstract and automatic abstract system are introduced , and the main formal models and methods of system are compared and analyzed , such as statistics based , meaning based , concept based , knowledge based etc . we induce their characteristics and put forward a kind of comprehensive automatic abstract system . thirdly , the concepts and category of reference of noun phrase are discussed , and noun phrase reference algorithm is introduced . the author also gives the analysis results of the noun phrase reference algorithm 然后我們介紹了文摘和自動文摘系統的基本概念體系,并針對目前幾種主要的自動文摘系統形式化模型和方法:基于統計的機械文摘、基于意義的理解文摘、基于概念的文本結構分析方法和基于知識的文本摘要等模型和方法進行了比較和分析,對它們的優點和缺點進行了討論,歸納出各自的特點,進而在總結各種不同類型的自動文摘系統的特點的基礎上,將基于統計的機械文摘、基于意義的理解文摘和基于概念的文本結構分析方法等三種研究方法相結合,提出了一種綜合型的自動文摘系統的設想。
In this thesis , the author first introduces the latest development of automatic abstract system in domestic and abroad , which shows the lack of the automatic abstract system research . then the author introduces some basic concepts about automatic abstract system 在本文中,我們首先介紹了計算語言學的基礎概念體系,給出了計算語言學的定義以及計算機對自然語言的研究和處理的方法和過程,我們還介紹了國內外關于自動文摘系統等方面的研究方向和發展動態,并指出了自動文摘系統研究的某些不足。
At present , keyword extraction is an important technique used for automatic abstract , automatic classification , subject extraction , subject word extraction etc . the paper introduces a new technique of keyword extraction and key concept extraction based on web page , the design and implement of experimental system , and the application of the system in the search engine 目前,由于計算機在自然語言理解方面還有很大的不足,關鍵詞提取是在進行文本自動摘要、文本自動分類、主題詞提取、主題提取等凡是涉及到文本信息理解的工作時,都要應用到的一項關鍵技術。本論文詳細介紹了一種基于web頁面的關鍵詞與關鍵概念提取技術及其實驗系統的設計與實現,并對該技術在搜索引擎中的應用進行了探討。